AITopics | speech engine

Collaborating Authors

speech engine

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Bringing Live Transcribe's Speech Engine to Everyone

#artificialintelligenceAug-28-2019, 01:56:29 GMT

Earlier this year, Google launched Live Transcribe, an Android application that provides real-time automated captions for people who are deaf or hard of hearing. Through many months of user testing, we've learned that robustly delivering good captions for long-form conversations isn't so easy, and we want to make it easier for developers to build upon what we've learned. Live Transcribe's speech recognition is provided by Google's state-of-the-art Cloud Speech API, which under most conditions delivers pretty impressive transcript accuracy. However, relying on the cloud introduces several complications--most notably robustness to ever-changing network connections, data costs, and latency. Today, we are sharing our transcription engine with the world so that developers everywhere can build applications with robust transcription. Those who have worked with our Cloud Speech API know that sending infinitely long streams of audio is currently unsupported.

artificial intelligence, live transcribe, transcribe, (14 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Speech (0.79)
Information Technology > Communications > Networks (0.55)

Add feedback

Google open-sources Live Transcribe's speech engine

#artificialintelligenceAug-18-2019, 03:44:57 GMT

The company hopes doing so will let any developer deliver captions for long-form conversations. The source code is available now on GitHub. Google released Live Transcribe in February. The tool uses machine learning algorithms to turn audio into real-time captions. Unlike Android's upcoming Live Caption feature, Live Transcribe is a full-screen experience, uses your smartphone's microphone (or an external microphone), and relies on the Google Cloud Speech API.

artificial intelligence, live transcribe, machine learning, (14 more...)

#artificialintelligence

AI-Alerts: 2019 > 2019-08 > AAAI AI-Alert for Aug 20, 2019 (1.00)

Industry: Information Technology (0.38)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.94)
Information Technology > Communications > Mobile (0.81)
Information Technology > Artificial Intelligence > Speech (0.58)

Add feedback

Text to speech Python Tutorial

#artificialintelligenceAug-12-2019, 20:09:04 GMT

We can make the computer speak with Python. Given a text string, it will speak the written words in the English language. This process is called Text To Speech (TTS). Pytsx is a cross-platform text-to-speech wrapper. It uses the Google Text to Speech (TTS) API.

speech engine, speech python tutorial, tts, (1 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Synthesis (1.00)
Information Technology > Artificial Intelligence > Assistive Technologies (1.00)

Add feedback

What the Voice-Recognition Industry Needs Most

AITopics Original LinksJan-18-2017, 10:32:12 GMT

I'm a big believer that voice-recognition technology will play an increasingly prominent role in how we interact with technology -- so much that I've made a bet on Nuance Communications (NAS: NUAN) accordingly as the clear technological leader in the field. So when the CEO of a small voice-recognition software company, Datria, reached out to me for a conversation, I jumped at the chance. Datria is a small private player with about 50 employees, and it resells Nuance's speech engine while also counting software giant SAP (NYS: SAP) as an investor. Jim Greenwell has been CEO for 12 years, and he provided valuable insight into the industry at large, as well as what trends users and investors should be on the lookout for. Who will rally the troops?

artificial intelligence, nuance, speech recognition, (20 more...)

AITopics Original Links

Country: Asia > China (0.05)

Industry: Information Technology > Software (0.91)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.86)
Information Technology > Artificial Intelligence > Speech > Acoustic Processing (0.86)

Add feedback